智能论文笔记

SGM-Net: Semantic Guided Matting Net

Qing Song , Wenfeng Sun , Donghan Yang , Mengjie Hu , Chun Liu

分类：计算机视觉

2022-08-16

人类垫子是指从具有高质量的自然图像中提取人类部位，包括人类细节信息，例如头发，眼镜，帽子等。这项技术在电影行业的图像合成和视觉效果中起着至关重要的作用。当绿屏不可用时，现有的人类底漆方法需要其他输入（例如Trimap，背景图像等）或具有较高计算成本和复杂网络结构的模型，这给应用程序带来了很大的困难实践中的人类垫子。为了减轻此类问题，大多数现有方法（例如MODNET）使用多分支为通过细分铺平道路，但是这些方法并未充分利用图像功能，并且仅利用网络的预测结果作为指导信息。因此，我们提出了一个模块来生成前景概率图，并将其添加到MODNET中以获得语义引导的Matting Net（SGM-NET）。在只有一个图像的条件下，我们可以实现人类的效果任务。我们在P3M-10K数据集上验证我们的方法。与基准相比，在各种评估指标中，我们的方法显着改善。

translated by 谷歌翻译

Using EBGAN for Anomaly Intrusion Detection

Yi Cui , Wenfeng Shen , Jian Zhang , Weijia Lu , Chuang Liu , Lin Sun , Si Chen

分类：人工智能

2022-06-21

作为一种主动网络安全保护方案，入侵检测系统（IDS）承担以恶意网络流量形式检测网络攻击的重要责任。入侵检测技术是ID的重要组成部分。目前，许多学者已经对入侵检测技术进行了广泛的研究。但是，为大规模网络流量数据开发有效的入侵检测方法仍然很困难。由于生成的对抗网络（GAN）具有强大的建模功能，可用于复杂的高维数据，因此它们为解决此问题提供了新的想法。在本文中，我们提出了一种基于Ebgan的入侵检测方法IDS-Ebgan，该方法将网络记录归类为正常流量或恶意流量。 IDS-Ebgan中的发电机负责将培训中的原始恶意网络流量转换为对抗性恶意示例。这是因为我们想使用对抗性学习来提高歧视者检测恶意流量的能力。同时，鉴别器采用自动编码器模型。在测试过程中，IDS-Ebgan使用歧视器的重建错误来对流量记录进行分类。

translated by 谷歌翻译

Gradient-Based Meta-Learning Using Uncertainty to Weigh Loss for Few-Shot Learning

Lin Ding , Peng Liu , Wenfeng Shen , Weijia Lu , Shengbo Chen

分类：机器学习

2022-08-17

模型不合时宜的元学习（MAML）是最成功的元学习技术之一。它使用梯度下降来学习各种任务之间的共同点，从而使模型能够学习其自身参数的元定义，以使用少量标记的培训数据快速适应新任务。几次学习的关键挑战是任务不确定性。尽管可以从具有大量任务的元学习中获得强大的先验，但是由于训练数据集的数量通常太小，因此无法保证新任务的精确模型。在这项研究中，首先，在选择初始化参数的过程中，为特定于任务的学习者提出了新方法，以适应性地学习选择最小化新任务损失的初始化参数。然后，我们建议对元损失部分的两种改进的方法：方法1通过比较元损失差异来生成权重，以提高几个类别时的准确性，而方法2引入了每个任务的同质不确定性，以根据多个损失，以基于多个损失。原始的梯度下降是一种增强新型类别的概括能力的方式，同时确保了准确性的提高。与以前的基于梯度的元学习方法相比，我们的模型在回归任务和少量分类中的性能更好，并提高了模型的鲁棒性，对元测试集中的学习率和查询集。

translated by 谷歌翻译

A Hybrid Deep Feature-Based Deformable Image Registration Method for Pathological Images

Chulong Zhang , Yuming Jiang , Na Li , Zhicheng Zhang , Md Tauhidul Islam , Jingjing Dai , Lin Liu , Wenfeng He , Wenjian Qin , Jing Xiong

分类：人工智能

2022-08-16

病理学家需要结合不同染色病理切片的信息，以获得准确的诊断结果。可变形图像配准是融合多模式病理切片的必要技术。本文提出了一个基于混合特征的基于特征的可变形图像登记框架，用于染色的病理样品。我们首先提取密集的特征点，并通过两个深度学习功能网络执行匹配点。然后，为了进一步减少虚假匹配，提出了一种结合隔离森林统计模型和局部仿射校正模型的异常检测方法。最后，插值方法基于上述匹配点生成用于病理图像注册的DVF。我们在非刚性组织学图像注册（ANHIR）挑战的数据集上评估了我们的方法，该挑战与IEEE ISBI 2019会议共同组织。我们的技术的表现使传统方法的平均水平注册目标误差（RTRE）达到0.0034。所提出的方法实现了最先进的性能，并在评估测试数据集时将其排名1。提出的基于特征的混合特征的注册方法可能会成为病理图像注册的可靠方法。

translated by 谷歌翻译

PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking

Yixuan Qiao , Hao Chen , Jun Wang , Yongquan Lai , Tuozhen Liu , Xianbin Ye , Xin Tang , Rui Fang , Peng Gao , Wenfeng Xie

分类：自然语言处理

2022-05-18

This paper describes the PASH participation in TREC 2021 Deep Learning Track. In the recall stage, we adopt a scheme combining sparse and dense retrieval method. In the multi-stage ranking phase, point-wise and pair-wise ranking strategies are used one after another based on model continual pre-trained on general knowledge and document-level data. Compared to TREC 2020 Deep Learning Track, we have additionally introduced the generative model T5 to further enhance the performance.

translated by 谷歌翻译

Improving Structured Text Recognition with Regular Expression Biasing

Baoguang Shi , Wenfeng Cheng , Yijuan Lu , Cha Zhang , Dinei Florencio

分类：计算机视觉

2021-11-10

我们研究识别结构化文本的问题，即跟随某些格式的文本，并建议通过指定偏见的正则表达式（正则表达）来提高结构化文本的识别准确性。偏置识别器识别与在其他文本上一般小的降级的成本中具有显着提高的指定正则表达式的文本。通过将正数表达式作为加权有限状态换能器（WFST）建模并通过动态替换将其注入解码器中来实现偏置。单个纤维率控制偏置强度。该方法可用于识别具有已知格式的文本行或包含来自域词汇的单词。示例包括驾驶执照号码，处方药中的药物名称等。我们展示了Regex偏置在印刷和手写的结构文本数据集上的功效，并测量其副作用。

translated by 谷歌翻译

Hyperspectral Image Denoising via Multi-modal and Double-weighted Tensor Nuclear Norm

Sheng Liu , Xiaozhen Xie , Wenfeng Kong

分类：计算机视觉

2021-01-19

高光谱图像（HSIS）通常遭受不同类型的污染。这严重降低了HSI的质量，并限制了后续处理任务的准确性。 HSI DeNoising可以建模为低级张量降解问题。张量奇异值分解引起的张量核定标（TNN）在此问题中起重要作用。在这封信中，我们首先重新考虑了TNN中的三个不起眼但至关重要的现象。在HSI的傅立叶变换域中，不同的频率切片（FS）包含不同的信息。每个FS的不同单数值（SV）也代表不同的信息。这两个物理现象不仅处于光谱模式，而且位于空间模式下。然后，基于它们，我们提出了一个多模式和双重加权的TNN。它可以根据所有HSIS的身体含义适应FS和SVS。在乘数的交替方向方法的框架中，我们设计了一种有效的交替迭代策略来优化我们提出的模型。对合成和真实HSI数据集进行了剥落的实验证明了它们在相关方法上的优势。

translated by 谷歌翻译

Cross Modal Transformer via Coordinates Encoding for 3D Object Dectection

Junjie Yan , Yingfei Liu , Jianjian Sun , Fan Jia , Shuailin Li , Tiancai Wang , Xiangyu Zhang

分类：计算机视觉

2023-01-03

In this paper, we propose a robust 3D detector, named Cross Modal Transformer (CMT), for end-to-end 3D multi-modal detection. Without explicit view transformation, CMT takes the image and point clouds tokens as inputs and directly outputs accurate 3D bounding boxes. The spatial alignment of multi-modal tokens is performed implicitly, by encoding the 3D points into multi-modal features. The core design of CMT is quite simple while its performance is impressive. CMT obtains 73.0% NDS on nuScenes benchmark. Moreover, CMT has a strong robustness even if the LiDAR is missing. Code will be released at https://github.com/junjie18/CMT.

translated by 谷歌翻译

KoopmanLab: A PyTorch module of Koopman neural operator family for solving partial differential equations

Wei Xiong , Muyuan Ma , Pei Sun , Yang Tian

分类：机器学习

2023-01-03

Given the increasingly intricate forms of partial differential equations (PDEs) in physics and related fields, computationally solving PDEs without analytic solutions inevitably suffers from the trade-off between accuracy and efficiency. Recent advances in neural operators, a kind of mesh-independent neural-network-based PDE solvers, have suggested the dawn of overcoming this challenge. In this emerging direction, Koopman neural operator (KNO) is a representative demonstration and outperforms other state-of-the-art alternatives in terms of accuracy and efficiency. Here we present KoopmanLab, a self-contained and user-friendly PyTorch module of the Koopman neural operator family for solving partial differential equations. Beyond the original version of KNO, we develop multiple new variants of KNO based on different neural network architectures to improve the general applicability of our module. These variants are validated by mesh-independent and long-term prediction experiments implemented on representative PDEs (e.g., the Navier-Stokes equation and the Bateman-Burgers equation) and ERA5 (i.e., one of the largest high-resolution data sets of global-scale climate fields). These demonstrations suggest the potential of KoopmanLab to be considered in diverse applications of partial differential equations.

translated by 谷歌翻译

Ranking Differential Privacy

Shirong Xu , Will Wei Sun , Guang Cheng

分类： (统计)机器学习 | 机器学习

2023-01-02

Rankings are widely collected in various real-life scenarios, leading to the leakage of personal information such as users' preferences on videos or news. To protect rankings, existing works mainly develop privacy protection on a single ranking within a set of ranking or pairwise comparisons of a ranking under the $\epsilon$-differential privacy. This paper proposes a novel notion called $\epsilon$-ranking differential privacy for protecting ranks. We establish the connection between the Mallows model (Mallows, 1957) and the proposed $\epsilon$-ranking differential privacy. This allows us to develop a multistage ranking algorithm to generate synthetic rankings while satisfying the developed $\epsilon$-ranking differential privacy. Theoretical results regarding the utility of synthetic rankings in the downstream tasks, including the inference attack and the personalized ranking tasks, are established. For the inference attack, we quantify how $\epsilon$ affects the estimation of the true ranking based on synthetic rankings. For the personalized ranking task, we consider varying privacy preferences among users and quantify how their privacy preferences affect the consistency in estimating the optimal ranking function. Extensive numerical experiments are carried out to verify the theoretical results and demonstrate the effectiveness of the proposed synthetic ranking algorithm.

translated by 谷歌翻译